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Partial sum rules are widely used in physics to separate low- and high-energy degrees of freedom 
of complex dynamical systems. Their application, though, is challenged in practice by the always 
finite spectrometer bandwidth and is often performed using risky model-dependent extrapolations. 
We show that, given spectra of the real and imaginary parts of any causal frequency-dependent 
response function (for example, optical conductivity, magnetic susceptibility, acoustical impedance 
etc.) in a limited range, the sum-rule integral from zero to a certain cutoff frequency inside this range 
can be safely derived using only the Kramers-Kronig dispersion relations without any extra model 
assumptions. This implies that experimental techniques providing both active and reactive response 
components independently, such as spectroscopic ellipsometry in optics, allow an extrapolation- 
independent determination of spectral weight 'hidden' below the lowest accessible frequency. 



INTRODUCTION 

Global sum rules applied to response functions play a 
major role in physics as they quantitatively express fun- 
damental conservation laws. Of interest are also partial, 
or restricted sum rules, where a properly chosen cutoff fre- 
quency separates low- and high-energy degrees of freedom 
of a physical system. For example, the low-frequency op- 
tical spectral weight [l[ 



(1) 



is a partial counterpart of the well known /-sum rule 
J Q a\(w)du) — nne /(2m e ) for the optical conductivity 
o~(u>) — <J\ (uj) + icr^ui), where n is the density of charges, 
e and m e are the charge and the bare mass of electron. 
In charge conducting systems, the integral up to a cut- 
off frequency lu c somewhat larger than the free-carrier 
scattering rate but below the energies of transitions from 
occupied to empty bands is proportional to the number 
of carriers and the inverse band mass averaged over the 
Fermi surface. For example, in the simple Drude model, 
■ integrating out to 10 (20) times the scattering rate re- 
covers 94% (97%) of the sum rule. 

The effective mass can be strongly affected by elec- 
tron correlations, especially in a case of narrow band- 
width. The dependence of W(uj c ) on temperature and 
other parameters, especially across a phase transition, is 
thus a valuable piece of information about the changes 
in the electronic system. The changes of W(oj c ) can be 
very large, as at the ferromagnetic - paramagnetic tran- 
sition in colossal magneto-resistance manganites @, 0], 
or rather small as at the superconducting transition in 
the high-T c cuprates 0, d, 0] . Even in the latter 
case, the subtle variations of the low-frequency spectral 
weight may potentially distinguish between physically 
different scenarios of superconductivity If a signif- 
icant spectral overlap between the free charge and inter- 
band peaks is present, the intrepretation can be less obvi- 
ous and requires direct theoretical calculation of the value 



of W(oj c ). Therefore it is worth using all means, exper- 
imental and computational, to improve the accuracy of 
W(lu c ) determined from the available spectra. Notably, 
the partial sum rules can be meaningfully applied not 
only to the optical spectra, but also in acoustical data, 
neutron scattering and other spectroscopic techniques. 

Because of limitations on the bandwidth of any spec- 
trometer, o~i(w) is not experimentally available down to 
zero frequency. The direct application of Eq.([T]) assum- 
ing some low- frequency extrapolations of a\ (u>) may lead 
to significant and uncontrollable error bars. This is most 
obvious for the optical conductivity of a superconduc- 
tor, where the spectral weight of the condensate of the 
Cooper pairs is represented in o~i(u>) by a 5-peak at zero- 
frequency. Here the inductive component 02 (w) has to be 
used to estimate the condensate spectral weig ht 
However, the common procedure used in this case still 
requires the extrapolation down to zero frequency. 

Importantly, certain experimental techniques, such as 
spectroscopic ellipsometry, or simultaneous measurement 
of acoustical attenuation and the sound speed allow di- 
rect independent measurement of both components of the 
response function. The purpose of this article is to show 
that the spectral weight W(ui c ), including a possible zero- 
frequency (5-peak, can be obtained model-independently, 
i.e. without any a priori assumptions about the low- 
and high-frequency spectral behavior, if both <j\{u>) and 
172(0;) are measured in a limited frequency range [w m i n , 
w max ]. We also present an efficient numerical algorithm 
optimized to reduce the output error bars in the case of 
the noisy data. 



ON THE POSSIBILITY OF ANALYTICAL 
CONTINUATION FROM A FINITE INTERVAL 



The causality principle, i.e. the assumption that no re- 
sponse can precede the causing factor, puts constraints on 
the analytic behavior of all physical response functions, 
such as the optical conductivity o~(lo) = o~i(u)) + 10-2(0)). 
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In particular, these functions obey the Kramers-Kronig 
(KK) dispersion relations: 



co Q co +A 



to 
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where p denotes the principal-value integral. Note that 
relations |J2J| and ([3]) are entirely model-independent since 
they only use the fact that the response function is ana- 
lytical in the upper complex semiplane. One can say that 
the information about cri(w) is encoded in 02 (w) and vice 
versa. 

The determination of the partial sum-rule integral ([1]) 
is intimately related to a more general problem to restore 
the function a(u) itself outside the experimental range. 
It is well known jll| that a complex function <j{uj) analyt- 
ical (holomorphic) in a certain domain D can be analyt- 
ically continued from a subset T of the boundary of this 
domain into the whole domain, including the rest of the 
boundary. The specific form of such a continuation has 
been a subject of numerous studies since the late 1920's 
[l3j ]. which are summarized in Reffl2l. In particular, the 
Carleman-Goluzin-Krylov formulas restore the function 
exarffaflij] inside the analyticity domain from its values 
at r. They have the following general structure: 



lim / Q n (uj' , oj)a(uj')dui' , oj £ D. (4) 

n — >oo 



There are several possibilities to choose the kernel 
Q„(uj' ,lu). We point out just one option for the case 

T = [tJmin, W max ] PJ] 



Q n {u',w) = 



(27U)" 1 



to 



0(w 











(5) 



that illustrates the general property of these kernels: they 
oscillate as a function of both variables, with the fre- 
quency of oscillations increasing infinitely as n increases. 
Therefore the 'lim' operation cannot be applied directly 
to Q n (uj',u>) as it only exists for the whole integral ((4|). 
Using Eqs. ((3]) and one can, in principle, obtain the 
spectral weight 



W{u> c ) = Re { lim 



where u r 



u n (u)' \uj c )a(uj')dw' > (6) 



(w', w c ) = J* c Q n (u',u)duj. 
In spite of the fact of its formal existence, the ana- 
lytical continuation of a function from a finite interval 
appears to be a very ill-posed problem. For example, 
one can construct an analytical function, which is al- 
most zero in the range [o; m i n , w max ] but shows intense 
spectral structure below u> m - m (Fig. [1]). Let us consider 
<r(uS) to be a sum of two narrow Lorentzians: the first 
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FIG. 1: (Color online) An example of <7i(u) and 0-2(01) satis- 
fying KK relations which are both almost zero at [a) m in,Wmax] 
but show intense spectral features below o> m i n . The 'unphysi- 
cal' negative values of <7i(o>) can be meaningful, for example, 
if 01 (w) is an addition to some background response function 
so that the result is positive. 



one with spectral weight A centered at ujq below uj m i n 
and the second one with the opposite spectral weight 
—A slightly displaced to u Q + A, still below w m ; n . Nor- 
mally, <7i(oj) cannot be negative; however, in this ex- 
ample one can think of <?i(u}) as an addition to some 
positive background response function, which makes the 
result always positive. A similar function would appear, 
in particular, if one takes a difference between two op- 
tical conductivities of the same sample at two different 
temperatures in a case when the conductivity contains a 
single optical phonon peak which shifts as a function of 
temperature (assuming that the phonon spectral weight 
remains unchanged) .The width of the peaks is assumed to 
be much less than A). The corresponding 02(0;) far from 
ujo would be approximately (— AAujoujA/tt)(uj 2 — u>q)~ 2 - 
By decreasing A, both ai(v) and 02(0;) can be made 
vanishingly small in the range [aj m i n , w max ] . Obviously, 
the slightest noise on top of a(w) that would make it in- 
distinguishable from zero prevents the extraction of the 
mentioned strong structures beyond the accessible range. 

In the considered example it was essential that the 
spectral weights of the two peaks exactly compensate 
each other in order to get the vanishing values of (J\{uj) 
and 172(0;) at [w m i n) w max ]. Otherwise one would get a 
detectable term ~ X/oj in 02(0;) proportional to the to- 
tal low-frequency spectral weight [H, [l6j]. This indi- 
cates that the 'hidden' spectral weight is much better 
determined by the limited-range data than the function 
ojjui) itself. This is supported by the study of Aspnes 
[17] . who examined the possibility to extrapolate an el- 
lipsometrically measured dielectric function beyond the 
experimental range using the Kramers-Kronig relations 
and found that the total spectral weights of a few broad 
spectral regions can be restored reasonably well while 
the high-resolution details of the spectra cannot be un- 
ambiguously determined. Some analytical treatments of 
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the problem of the finite frequency range can be found 



in Refs. 



should also take care that the error bar of the resulting 
spectral weight due to the noise 



A PRACTICAL ALGORITHM FOR THE NOISY 
DATA 

We are interested in an efficient and accurate numerical 
scheme to determine the sum-rule integral ([!])). It turns 
out that the straightforward application of the formulas 
(H]) and (|6|) with strongly oscillating kernels to real data 
is not practical as it amplifies uncontrollably the exper- 
imental noise. In this case one has to look for different 
numerical algorithms. 

The experimental spectra are collections of data points 
a \ j ± 5a\j and a^j ± <5<72j on a mesh of frequencies uij 
(j = l,..,N). We assume that the spectral resolution 
is roughly the same as the distance between neighboring 
points. According to Eq. W{uj c ) is a linear function 
of the real and imaginary parts of a. We note that this 
directly follows from the fact that the KK relations are 
linear and is independent from the particular scheme of 
analytical continuation. Hence it is logical to take the 
following formula for the calculations: 



W{w c 



N 

E 



«l,7 cr l, 



(7) 



which is the most general linear relation between W(oj c ) 
and the measured values aij and <J2,j- The coefficients 
Ui t j and U2,j that we call hereafter u-coefficients have to 
be chosen in such a way that formula j7]) is approximately 
correct for any arbitrarily chosen response function. Since 
any causal response function can be represented as a lin- 
ear superposition of narrow oscillator response functions 
S x (to) = Si x (to) + iS2x{^) centered at all frequencies x, 
one should optimize the u-coefficients in such a way that 
it gives a reasonably accurate answer when applied to 
er(cij) = S x (uj) for any x. For S\ x {oj), one can take a 
narrow peaked function, for example a Gaussian, while 
S2x{u) should be the KK transform of S\ x (uj). 

We introduce a function D(x), which is the inaccuracy 
of the formula ([7J when applied to S x (ui) 

rue N 

D(x) = / S lx {uj)duj - [uijS^iujj) + u 2 jS 2x {uJj)} 
The integrated inaccuracy can be defined as follows 

p CO 

Df nt = / D 2 (x)dx. (8) 
Jo 

In order to make the formula ([7]) accurate for all x at the 
same time, one would need to minimize D 2 nt by varying 
the u-coefficients. If experimental noise is present, one 



F 



N 

E 



(9) 



does not become too large. In general, D 2 nt and F 2 have 
to be minimized simultaneously. A more detailed de- 
scription of this algorithm is given in the Appendix 1. 

We implemented this idea in a numerical code Devin 
[21], which takes a set of data points of ai(cu) and 02(0;) 
with error bars in a limited range [w m i n , w max ] and re- 
turns the estimated value and error bar of W(u> c ) for 
specified cutoff frequency ui c . It is assumed that the real 
and imaginary parts of the response function are KK- 
consistent; otherwise output error bars are unpredictable. 
Note that the KK consistency of the limited-range data 
can be tested using exact bounds proposed in Ref . (lij . 



RANDOM TESTS OF THE METHOD 

A series of Monte-Carlo tests were performed where the 
program answer based on limited-range spectral informa- 
tion can be compared with the exact value of W(u) c ), al- 
lowing one to scrutinize the model independence of the 
method. We generate a random KK-consistent func- 
tion a(uo) by adding up a random number of peaks of 
random spectral weights widths 7^ and center fre- 
quencies uiQ.k distributed below, inside and above the 
range [w m i n , ui max \. In particular, a sum of Lorentz peaks 
plus high-frequency background was taken[22j|: <j(u>) = 
J] fe (-2iAfew/7r)(wo ;fc -w 2 -i7few) _1 -?eoow/(47r), where 
the number of peaks is changed between 1 and 50. The 
parameters were varied in the following limits: Ak - be- 
tween and 125, "fk - between and 3, ujq^ - between 
and 5, - between and 5. The first oscillator 
was always a delta function at zero frequency (wq,i = 0, 
71 =0), imitating a condensate peak in superconductors. 
The 'experimental' data points sent to the program were 
generated with a step of 0.1 inside the range [w m i n , w max ] 
by convoluting the true functions with a Gaussian of 
width 0.1 to mimic the finite resolution and adding ran- 
dom noise with standard deviation of 8a. Each test se- 
ries consisted of 30 independent generations of cr(w) and 
a comparison of the true and calculated W(lu c ). 

The test results are summarized in FigJ^J The series 
(a)-(c) demonstrate the sensitivity of the extracted spec- 
tral weight to the data noise. The experimental range [1, 
4] and the cutoff u) c = 1 are the same, while 5a varies 
from 0.5 to 10. One can see that for sufficiently small 
noise the program is able to provide quite accurate value 
of W(u c ) for all 30 random inputs, which demonstrates 
that the method is indeed model- independent. On the 
other hand, relatively large error bars in series (b) and 
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FIG. 2: (Color online) Series (a)-(g) of Monte-Carlo tests of 
the numerical algorithm that calculates the spectral weight 
W(ui c ) knowing only (Ti(u>) and a^iio) in a limited range 
[^min , Wmai] • Left panel: the correlation between the 'true' 
spectral weight Wtrue(^c) and the calculated one W ct ,i c (w c ); 
each point corresponds to one randomly generated KK con- 
sistent response function as described in the text. The error 
bars are as estimated by the program. Green lines refer to 
W ca i c (uj c ) — Wtrue(u c )- Right panel: the curves 171,2(0;) cor- 
responding to the point marked in red color on the left panel. 
Solid lines are the 'true' functions, dots are the data points 
(with noise 8a added and finite resolution applied) used as 
input. The values of u> m ax, u c and Sa corresponding to each 
series are given on the left panels (o; m j n = 1 in all series). 



(c) show that the requirements to the signal-to-noise ra- 
tio for this particular set of data points are quite strict 
(~ 1 %), although not unrealistic. 

Series (a), (d) and (e) give a feeling of how the accuracy 
of this procedure depends on the width of experimental 



range at a constant noise amplitude. The range was con- 
secutively narrowed from [1,4] to [1,2]. The range width 
appears to be a critical factor determining the method 
precision. The output error bars increase rapidly as we 
restrict the experimental range - much faster, in fact, 
than what one would normally expect due to a simple 
decrease of the number of data points. 

By comparing series (a), (f) and (g) one can see that 
the error bars increase dramatically if the cutoff lo c is 
taken beyond the experimental range, both below uj m - ul 
or above w max - This is another indication that one can 
reliably determine the total 'hidden' spectral weight but 
not a part of it since the latter depends on the inaccessible 
spectral details of <J\(uo). 
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FIG. 3: (Color online) The application of the partial sum- 
rule analysis to optical conductivity (a) of optimally doped 
Bi2Sr2Ca2Cu30io at room temperature for electric field par- 
allel to CUO2 planes (Ref 23). All frequencies and conduc- 
tivities are divided by 2-7rc in order to get the wavenumber 
units of cm -1 . Above 6000 cm -1 , a\{ui) and 02(0;) were mea- 
sured directly by spectroscopic ellipsometry; below 6000 cm" 
they were derived with error bars from simultaneous KK- 
constrained fit of reflectivity [24| . measured down to 100 cm -1 
and the ellipsometric spectra, (b) The value of W(u) c ) calcu- 
lated for a; c =8000 cm -1 as a function of u m j n for aj ma x=20000 
cm -1 , (c) The value of W(uj c ) for w c =8000 cm -1 as a function 
of Wmai for o> m i n =100 cm . The error bars are indicative, 
as estimated by the program. 
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REAL-DATA EXAMPLE 

In Fig |3] the application of this program to real data is 
demonstrated. As an example, we calculate the sum-rule 
integral for the optical conductivity of a high-T c super- 
conducting compound Bi2Sr2Ca2Cu30io, based on data 
published in Refl23l. We use a set of data which spans 
the interval of wavenumbers lu/(2ttc) between 100 cm -1 
and 20000 cm -1 . In Fig|3h> each point corresponds to a 
value of W(u> c ) calculated on the basis of input data from 



=20000 cm as a function of ui n 



The er- 



ror bars are relatively small as long as a-> m ; n is less than 
u> c , but grow explosively as w m ; n exceeds u> c . In Figj3j: 
each point corresponds to a value of W(uj c ) calculated 
on the base of input data from w m j n =100 cm -1 to ui max 
as a function of w max . When ui c is above the highest 
frequency point, the error bars again tend to diverge. 



DISCUSSION AND CONCLUSIONS 

One should note that the the involved algorithm that 
we used to extract the partial sum-rule integral W(lu c ) is 
not the only possibility to tackle this problem. Perhaps 
a more conventional and intuitive method is to fit the 
available spectra with a KK-consistent multi-parameter 
function, such as the Drude-Lorentz model or the vari- 
ational function of Ref.[24j. A similar method of the 



Kramers-Kronig integrals was used by Aspnes [17| . 

Interestingly all fitting schemes where the model func- 
tion is a linear superposition of the trial basis functions 
eventually reduce to the same type of linear formula ([7]), 
as demonstrated in the Appendix 2. However, there is 
an important difference between the two approaches. In 
the method that we used in this paper, the u-coefficients 
are optimized in order to minimize the output error bars 
5W(u) c ), while in the fitting approach they are predeter- 
mined by the specific set of trial basis functions, which 
may not be always optimal. In this sense, our approach 
is more general and model- independent. 

On another hand, the data fitting approach can be 
straightforwardly applied also to experimental quantities 
that depend on (J\{lu) and a^w) in a non- linear way, for 
example, the optical reflectivity. Another advantage of 
the data fitting technique is that it can detect if (7i(uj) 
and <T2 (w) are KK inconsistent, for example, due to the 
systematic experimental uncertainties. Thus it is prefer- 
able to use both techniques in a combination. 

In summary, we have shown that the partial sum-rule 
analysis can be accurately performed on the basis of the 
real and imaginary parts of a response function, if the 
latter is experimentally available in a limited spectral 
range. In a sense, nature integrates for us t7i(w) beyond 
the accessible range and encodes information about this 
integral in a^w) inside the range where the experimental 
data is available. We have shown that it can be decoded 



using a simple linear formula ([7|) with optimally chosen 
u-coefficients. The determination is accurate only if the 
cutoff frequency is lying inside the accessible interval. 
Even though the interval can be, formally speaking, ar- 
bitrarily small, the extraction of the sum-rule integral for 
a narrow interval would require much better data accu- 
racy to obtain equivalent precision of W(uj c ) than for a 
broad one. We find, however, that the error bars are not 
uncontrollably large due to the notorious extrapolation 
uncertainty. They can be, at least in principle, made 
arbitrarily small by decreasing the experimental noise. 
This result is valid for all response functions satisfying 
the KK relations and thus applies to various domains of 
spectroscopy. 
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APPENDIX 1. OPTIMIZATION OF THE 
U-COEFFICIENTS 



Here we describe a method that we use to optimize the 
u-coefficients in the approximate formula 



N 



(10) 



v=\ j=l 



The starting point is the spectral representation of the 
response function: 



<t(lo) — p / <Ji(x)S(uj,x)dx, 



(11) 



where S(uj, x) = S(lu + x) + S(uj — x) + (i/ir)[(bj — a;)" 1 + 
(uo + x)^ 1 ] is a response of an oscillator centered at fre- 
quency x (and at —x to preserve the parity of the re- 
sponse functions). Put differently, any causal response 
function can be represented as a linear superposition of 
narrow oscillator functions. In reality, one has to con- 
volute S(lu,x) with an apparatus function A(lu,lu'), for 
example, a Gaussian, which takes the experimental res- 
olution into account: S(uj,x) — J A(uj,u)')S{<jj' , x)dw' . 
The key idea of the method is based on the linearity of 
Eq. (TO)) : in order to make this formula applicable to 
any <t(cj), one should optimize the u-coefficients in such 



G 



a way that it gives a reasonably accurate answer when 
applied to <t(lu) = S(to,x) for any x. 

To be quantitative, one can calculate a discrep- 
ancy function D(x) for a given combination of the u- 
coefficients 



and 



where 



D(x)=W s {x)-W s ,a(x), 



W s (x) = / S x {u,x)du 
Jo 

2 N 

W Sta (x) = ^^UvjSvjujj,: 

v=\ 3=1 



One can also define the discrepancy integrated over all x 



D 



D 2 {x)dx. 



(12) 



In order to make the formula (|10p accurate for all x si- 
multaneously (assuming that it is possible!), one should 
minimize the integral discrepancy by varying the u- 
cocfficients. This is easy since D 2 nt ({uij}, {u 2 j}) is a 
quadratic form 

2 N 2 N 

f,/x=l u—1 i—1 

where the coefficients are given by the integrals over x: 



A . . — 
B v .i — 
C = 



S u (uj i ,x)S u (uJ : j,x)dx, 
S„(u} i: x)W s (x)dx, 



(13) 



The minimization of D 2 nt reduces to the linear system of 
equations: 



2 N 
^=1 j=l 



In a case when the (frequency-dependent) spectral res- 
olution S(ui) is given by a Gaussian 



A(w,w') = 



1 



27n5(w 



■ exp 



(cj-u/) ; 
25 2 (w) 



the above integrals can be taken analytically, which sim- 
plifies dramatically the calculations: 



Si (a;, x) 
#2(0;, x) 
W s {x) 



1 



1 

1 
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LO + X 
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uj + x 
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.93 



a; c + a; 



/ cj c — a; 



\V26(uj c )) +93 Vx .) 



A-22,ij 



.91 



.92 



.92 



.91 
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.92 



-,9i 



B2,i 

C 



.93 

,9i 



.93 



.91 



where W ± = ( W < ± Wi )/2, = [<S 2 (^) + ^K)]/2, w± = 
(wj ± w c )/2, Sf c = [6 2 {u; z ) + 6 2 {u c )}/2 and S c = 5(w c ). 
Here we used auxiliary functions 

gi(x) = TT- 1/2 exp(~x 2 ) 

g 2 (x) = tt^ 1 / 2 exp(-x 2 ) erfi(cc) 

g 3 (x) = erf (a;) 

ff 4 (aO = 2^- 1 a; 2 2J F 2 (1,1, 3/2,1, -x 2 ) 

55 (») = 27r" 1/2 [exp(-a; 2 )-l] + 2a;erf(x), 

where erf(x) is the error function, crfi(x) = eif(ix)/i 
the imaginary error function and 2^2 (.-,2;) the hyperge- 
ometric function. 

So far we ignored the experimental uncertainty of 
ax(u)) and (72 (ui), which is another source of the output 
error bars. If we assume that it is just random noise with 
the standard deviations at each frequency given by Saij 
and 5(72,3 then the standard deviation of the output of 
Eq.m> is: 



2 N 



(14) 



Our experience shows that the optimization of D 2 nt 
alone may provide very inaccurate results for a noisy in- 
put. Instead, the minimization of a compound functional 
D 2 nt + wF 2 , where w is a weighting coefficient which is 
discussed later, works much better. From a numerical 
point of view, adding wF 2 enhances the diagonal ele- 
ments of the matrix A: 
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and makes it better conditioned. This addition prevents 
the u-coefhcients from growing too much, therefore one 
can consider it as a regularization term. 

The coefficient w describes the relative significance of 
the data noise compared to the inaccuracy of the linear 
Eq . (fT0|) . The subtlety is that the optimal value of w is 
determined by W(u> c ) and is thus not known a priori. 
A way out is to use a second optimization loop for w. 
As a criterion, it is logical to minimize the estimated 
total error bar (SW) 2 , i.e. caused by both the formula 
uncertainty and the input noise. 

A simple estimate that we use (which can be perhaps 
improved) is the following. For a given set of the u- 
coefficients, one can find W a , F 2 and D 2 nt . Since the 
two error sources are independent, one can approximately 
determine the range of possible values of the true spectral 
weight W by the inequality: 



(W - W a ) 2 < F 2 



KDl t W 2 /u c , 



(15) 



where k is a number of the order 1-10. The goal of this 
parameter is to adjust the accuracy of the rough estimate 
(|15p . The variation of k by one order of magnitude does 
not significantly affect the value of Wq , although it does 
modify SW. We found that k — 5 gives the best results in 
the numerical tests described in the main text. It is likely 
that more accurate estimates can remove an ambiguity 
here. 

The inequality (fT5")) can be resolved with respect to W 



Wq-SW <W <W + SW. 



(16) 



where (b = nD 2 nt /u) c ): 
Wo = 
SW = 



W a 



1-6 

^bW 2 + (1 - b)F 2 
l-b 



After the numerical minimization of (SW) 2 as a function 
of w, the u-coefficients become dependent on ai j and 
(T2 j - For example, in a case of a very large value of W, 
the relevant importance of the input error bars is small 
and the optimal it-coefficients tend to oscillate stronger 
than in a case of small W. 

Fig. [4] shows some examples of the u-coefficients op- 
timized as described above and the corresponding dis- 
crepancy function D(x) for different sets of experimental 
frequencies and noise levels. Clearly, the spectral dis- 
crepancy function can be made quite small for all a; by a 
proper optimization. The comparison between panels (a) 
and (b) tells that the discrepancy function is smaller if 
the range of experimental frequencies is broader. On the 
other hand, from panels (b) and (c) one concludes that 
taking error bars into account makes the u-coefficients 
much smaller, which results in a somewhat larger D(x) 
but in a better overall accuracy (SW) 2 (not shown). 



40 

20 



-20 

-40 
100 

50 



-50 

-100 

4 

2 



-2 

-4 



.(a) 


■ i 1 i . 

i — u ■ 


H 

- 


■ 

i , i - 


i ■ i ■ i 
: (b) J\ : 


Hh 


■(c) 


■ i ■ i 
■■ 


H 

■ 

\ 


i . i 



1.0 

0.5 

0.0 
1.0 

0.5 

0.0 

1.0 

0.5 
0.0 h 



1 1 


i ■ i 

w - 

s 

— w 

sa 

D 






1 


i 1 i 

\ ~ 


: \ 






i < i 


; - \ 


i.i.i 



1.2. 3 

to (a.u.) 



12 3 

x (a.u.) 



FIG. 4: The optimized u-coefficients (left panel) and 
the corresponding functions W s (x), W S)a (x) and D(x) = 
W s (x) — W s ,a(x) (right panel). Everywhere w c =1.5, Lo m - ln =l, 
5(o»)=0.1. (a) Wmax=2, no data noise; (b) w m ax=3, no data 
noise (c) oj max =3, some data noise is added. 



APPENDIX 2. THE EQUIVALENCE OF THE 
LEAST-SQUARE FIT OF AND a 2 (io) AND 

THE LINEAR APPROXIMATION FORMULA 



The most straightforward way to determine spectral 
weight is to fit experimental data with a model function 



M 

CmodM = ^ CfcQ-fc (ui). 
fe=l 



(17) 



This is a linear superposition of some basis functions 
crfe(w) = ai t k(w) + icf2,k(^), Lorentzians, for example, 
each of which satisfies the KK relations. Then one can 
estimate the partial sum-rule integral, using this model: 



W(lo c ) 



M 

ai. mod (uj)duj = 22 c k W k , 
fc=i 



(18) 



where W k = J^ c iri t k(u)du- 

Fitting the data points {u)j,a v j,5a u> j} (v = 1,2, j = 
1..N) with the model function (fT7)) in the least-square 
sense means the minimization of 



X 



2 N 
v— 1 7 — 1 v '*■ 



0~v,j — ^ CkO- v< k(tjJj) 



k=l 



8 



with respect to the coefficients This gives a linear 
system of equations 



M M 

^ aikCk = h, or c fc = y ^{a~ 1 )ikh, 
fe=i 



(19) 



z=i 



where 



a-ik 



2 N ~ i \~ i \ 



2 JV ~ 



(5^)2 



(20) 



Substituting Eq.|l9]) and ([20]) into Eq. ([181) we ob- 
tain, after some transformations, an expression identical 
to Eq.JlOl): 



2 N 

with the u-coefficients independent of <J v ,j' 
1 M 

U »<J = \2 H ( a ~\kVv,k{uj)Wi. (22) 

This implies that the strategy to obtain W(lo c ) by fit- 
ting eri(w) and 172(0;) is nothing else but the application 
of the linear formula (fTQ|) with specific u-coefficients de- 
termined by the basis of functions ak(co). 



* Permanent affiliation: Dept. of Physics, University of 
Alberta, Edmonton, AB, Canada, T6G 2J1. 
[1] F. Wooten, Optical Properties of Solids, Academic Press, 
New York, 1972. 



[2] Y. Okimoto et al, Phys. Rev. Lett. 75, 109 (1995). 

[3] A.J. Millis, J. Electron. Spectrosc. Rel. Phenom. 114- 

116, 669 (2001). 
[4] D. N. Basov et al, Science 283, 49 (1999). 
[5] H.J.A. Molegraaf et al., Science 295, 2239 (2002). 
[6] A. F. Santander-Syro et al, Europhys. Lett. 62, 568 

(2003). 

[7] A.V. Boris et al, Science 304, 708 (2004). 
[8] J. E. Hirsch, Physica C 199, 305 (1992). 
[9] M. Tinkham, Phys. Rev. 104, 845 (1956). 
[10] S. V. Dordevic et al, Phys. Rev. B 65, 134511 (2002). 
[11] T.M. MacRobert, Functions of a Complex Variable 

MacMillan, London (1958), p. 122. 
[12] L. A. Aizenberg, Carleman's Formulas in Complex Anal- 
ysis, Kluwer Academic Publishers, (1993). 
[13] T. Carleman, Les fonctions quasianalytiques, Gauthier- 
Villars, Paris (1926); G. M. Goluzin and V. I. Krylov, 
Mat. Sb. 40, 144 (1933) (in Russian). 
[14] The exact conditions on o-(lj) for the validity of the 
Carleman- Goluzin- Krylov formula are given in Ref . fl2T| . 
For the present discussion it is important that all physi- 
cal functions with the poles below the real axis would fit 
these requirements. In the case of the optical conductivity 
of a superconductor, which has a pole exactly at the real 
axis, one can shift the pole slightly down. This would 
only broaden the 5-peak by an arbitrary small amount 
but simultaneously provide the necessary condition. 
I. Bozovic, Phys. Rev. B 42, 1969 (1990). 
A.B. Kuzmenko et al, Phys. Rev. B 72, 144503 (2005). 
D.E. Aspnes, Phys. Rev. B 12, 4008 (1975). 
R. Hulthen, J. Opt. Soc. Am. 72, 794 (1982). 
G.W. Milton and D.J. Eyre, and J.V. Mantese, Phys. 
Rev. Lett. 79, 3062 (1997). 

A. Dienstfrey and L. Greengard, Inverse Problems 17, 
1307 (2001). 

http: / /optics. unige.ch/Devin2 1 



[15] 
[16] 
[17] 
[18] 
[19] 

[20] 



[21] On-line robot version 



[22] 



[23] 
[24] 



can be used for further tests. 
In these tests, uj and a are taken in their respective (ar- 
bitrary) units; the unit of W(lj c ) is the product of the 
two. 

F. Carbone et al, Phys. Rev. B 74, 024502 (2006). 
A.B. Kuzmenko, Rev. Sci. Instrum. 76, 083108 (2005). 



